Fix GPT-OSS Harmony format end token handling crash #15195

trvon · 2025-08-09T17:53:38Z

Summary

Fix server crash when using GPT-OSS models with tools that was caused by unconsumed <|end|> tokens in the chat parser.

Changes

Module: `chat-parser`

Problem: GPT-OSS server crashed with "Unexpected content at end of input" when tools were used
Root Cause: common_chat_parse_gpt_oss() only consumed content when parse_tool_calls=false, leaving <|end|> token unconsumed when parse_tool_calls=true (tools enabled)
Solution: Modified parser to consume all remaining content in both parse_tool_calls cases

Manual Validation

Before Fix:

# Server crash with GPT-OSS + tools
curl -X POST http://127.0.0.1:8080/v1/chat/completions \
  -d '{"messages": [{"role": "user", "content": "What is weather?"}], "model": "gpt-oss-20b", "tools": [...]}'
# Result: Server crash - "Unexpected content at end of input"

After Fix:

# Same request now succeeds
# Result: {"choices":[{"message":{"role":"assistant","reasoning_content":"...","content":""}...}]}

The GPT-OSS model uses OpenAI's Harmony format which includes an <|end|> token after the final message. The parser wasn't handling this token properly, causing finish() to throw 'Unexpected content at end of input'. Modified the GPT-OSS parser to strip the <|end|> token from the content if present. Added comprehensive tests for GPT-OSS format with and without the end token to ensure backward compatibility.

… by unconsumed <|end|> tokens in the chat parser.

Copilot

Pull Request Overview

Fix server crash when using GPT-OSS models with tools that was caused by unconsumed <|end|> tokens in the chat parser.

Modified GPT-OSS chat parser to properly handle <|end|> tokens in both tool and non-tool scenarios
Added reasoning format parameter passing from templates to server configuration
Added comprehensive test coverage for GPT-OSS Harmony format parsing edge cases

Reviewed Changes

Copilot reviewed 5 out of 5 changed files in this pull request and generated 1 comment.

Show a summary per file

File	Description
common/chat.h	Added reasoning_format field to common_chat_params struct for template-specific configuration
common/chat.cpp	Fixed GPT-OSS parser to consume remaining content and handle `<
tools/server/utils.hpp	Added reasoning_format parameter passing from chat templates to server parameters
tools/server/server.cpp	Updated server to use template-provided reasoning format with proper fallback logic
tests/test-chat-parser.cpp	Added comprehensive test suite for GPT-OSS Harmony format edge cases and token handling

Copilot · 2025-08-09T17:54:13Z

common/chat.cpp

+    } else {
+        // No <|end|> token, consume everything remaining
+        if (!builder.syntax().parse_tool_calls) {
+            builder.add_content(builder.consume_rest());


This line is unreachable because it's inside an else block that follows a return statement that was removed. The logic flow suggests this should be part of the else clause for the if (!builder.syntax().parse_tool_calls) condition.

I made changes that address this in subsequent commits.

trvon · 2025-08-09T19:40:02Z

I see and I am merging the changes from #15181 against my own in another branch.

trvon added 2 commits August 9, 2025 12:45

Fix server crash when using GPT-OSS models with tools that was caused…

730c0d9

… by unconsumed <|end|> tokens in the chat parser.

Copilot AI review requested due to automatic review settings August 9, 2025 17:53

trvon requested a review from ngxson as a code owner August 9, 2025 17:53

github-actions bot added testing Everything test related examples server labels Aug 9, 2025

Copilot AI reviewed Aug 9, 2025

View reviewed changes

trvon added 2 commits August 9, 2025 14:01

fix GPT-OSS end token content consumption logic

8c45ee7

simplify GPT-OSS parser to always return content

05002b5

trvon closed this Aug 9, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Fix GPT-OSS Harmony format end token handling crash #15195

Fix GPT-OSS Harmony format end token handling crash #15195

trvon commented Aug 9, 2025

Uh oh!

Copilot AI left a comment

Uh oh!

Copilot AI Aug 9, 2025

Uh oh!

trvon Aug 9, 2025

Uh oh!

trvon commented Aug 9, 2025 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Uh oh!

Fix GPT-OSS Harmony format end token handling crash #15195

Fix GPT-OSS Harmony format end token handling crash #15195

Conversation

trvon commented Aug 9, 2025

Summary

Changes

Module: chat-parser

Manual Validation

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Reviewed Changes

Uh oh!

Copilot AI Aug 9, 2025

Choose a reason for hiding this comment

Uh oh!

trvon Aug 9, 2025

Choose a reason for hiding this comment

Uh oh!

trvon commented Aug 9, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Module: `chat-parser`

trvon commented Aug 9, 2025 •

edited

Loading